Monte Carlo tree search
For balancing exploitation and exploration, UCT (Upper Confidence Bound applied to trees) algorithm was introduced by Levente Kocsis and Csaba Szepesvári. Kocsis, L. and Szepesvári, C., 2006, September. Bandit based monte-carlo planning. In European conference on machine learning (pp. 282-293). Springer, Berlin, Heidelberg. Related:
en.icon